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ABSTRACT 

Neuropilin2 is a family of receptor protein involved in 
endocrine development in Breast and ducts. The 
bHLH regulates transcription of Neuropilin2 (NRP2) 
of progenitor ductal and endocrine cells to a 
neurogenesis and neurogen specific transcriptional 
mechanisms that mediates different signaling 
pathways. The differentiated genes such as RAB40A, 
FGFR1, TPM1, NRP2, and CLMN genes regulating 
the development of islet of endocrine and ductal cells, 
but the molecular mechanism and classification of 
gene expression is remain unknown. There are several 
transcriptional gene mutations may regulate 
transcription of islet cells of ductal and endocrine 
regions of the pancreas and intestine that may lead to 
cancer. However, our knowledge of microarray data 
analysis methods helps to classify the genes 
associated with differential and undifferential 
endocrine lineage, ductal cell and exocrine regions 
determine neurogenesis and neuron specific signaling 
pathways. Using Meta analysis of statistical rank 
correlation algorithm to rank the genes based on gene 
signatures. The reveal predicts 154 (38%) genes that 
were consistently and significantly up regulated 247 
(62%) were down regulated in Breast cancer. Using 
functional annotation of gene clusters reveals only 47 
genes is presumably associated with neuronal 
development and cell differentiation. Furthermore, 
this experiment helps to understand candidate genes 
for novel biomarkers identification, diagnosis and 
therapeutic approaches to Breast cancer. 
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1. INTRODUCTION 

Breast cancer (BC) is the fourth leading cause of 
cancer deaths [1] among women, being responsible 
for 6% of all cancer-related deaths and it is very 
difficult to diagnose in its early stages. At the time of 
diagnosis, 26% have survival rate, past 5 years only 
21.5% of survival rate has been recorded [2], With 
non-specific symptoms 90% of patients are diagnosed 
for surgery in advanced stage. Prior to this study, 
nearly 95% of BC is associated with hereditary factors 
that may greater risk of BC [3], The deadly nature of 
BC stems from its tendency to rapidly spread to the 
lymphatic system and distant organs. There are 
various types of Breast cancer include 78% are ductal 
carcinomas of the ductal epithelium. Only 5% of 
tumors of the breast are benign. 

In most patients, the BC is observed in both exocrine 
and endocrine regions, however the biological 
phenomenon shows the exocrine region release 
insulin- producing beta cells during embryogensis and 
endocrine region regulates glucose tolerance [4], The 
pathological association of diabetes in exocrine region 
and Breast cancer is in the endocrine region [5,6]. 
During the development of progenitor cells, which 
express specific genes Foxal, Foxa2, Pdxl, Pbxl, 
Hesl, Ptfla, Ngn3, HNF6, Pax4, NeuroDl, Nkx2.2 
and Sox9, give escalate to express in both the ductal, 
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exocrine and endocrine Breast cell development [7], 
In cell division, the inactivation of genes that may 
transcription ally regulate embryogenesis of neuronal 
cells it may leads to cancer [8]. The Neuropilin2 
(NRP2) controls a complex of gene regulatory 
transcriptional networks in endocrine progenitor cells 
which regulate insulin secreting islets of Langerhans 
[9], The relationship of exocrine and endocrine 
development of multi potent intracellular genes that 
may limit the expression during precursors of ductal 
polypeptide producing cells. 

The experimental results suggest that there are other 
transcriptional signaling pathways play important 
regulatory roles during BC development and the 
molecular mechanism are unknown. There are various 
techniques to comprehend the mechanism of 
molecular and genes responsibility for intra cellular 
signaling pathways. Using gene expression program 
determines the characterization of NRP2-mediated 
duct and endocrine cell mediated signaling pathways. 
The studies generated large set of adenoviruses 
expressing NRP1 and NRP2 of genome wide mRNa 
profiling data is revealed the expression analysis to 
reprogram ngn3 expression on Breast cancer gene 
analysis [11], 

The objective of the study is to understand the disease 
etiology of novel candidate gene expression in disease 
progression, cell cycle, neuronal development and cell 
differentiation. We used gene expression dataset to 
understand the characteristic NRP-2 mediated mRNA 
profiling within duct, exocrine and endocrine cell 
functional genes that may involved in BC and 
diabetes mellitus. We evaluate the transcriptional 
regulation of genes that may involved in expression 
with [3-islets within endoderm, that may helps to 
synthesize insulin, neuronal development mediated 
gene signaling networks that may helps to synthesize 
neuronal cells to form interconnection between 
mammary alveoli and ducts. In these we temporarily 
targets NRP-2 induced Breast with P-islets within 
endocrine cells, the genes that differentially expressed 
in both drug targets and disease conditions are 
typically predicts for molecular biomarkers. The 
biomarkers helps to identify the genes associated with 
neuronal cell development of pancreas along with 
other types of diseases also. 

2. Materials and Methods 
2.1 Disease dataset selection 

The data used in this study is publicly available from 


Gene expression Ominibus (GEO) database (GEO 
dataset: GSE67301) [11]. The dataset contains 37 
breast tissue samples were collected under ultrasound 
guidance from patients with stage three breast cancer 
before four cycles of whole breast hyperthermia. Gene 
expression analysis was done using Affymetrix U133 
Plus 2.0 Gene Chip arrays. 

The advantage of using this data is to understand the 
expression of NRP2 mediated human duct cells is 
recombinant with adenovirus and the induction of 
NRP2 expressions with different time intervals. The 
different time intervals is helps to predict the 
differential expression of NRP2 in duct, endocrine 
and exocrine cell differentiation in breast. 

2.2 Normalization of gene expression dataset 

The dataset is from a gene expression levels of Breast 
cancer samples from homosapiens, the details of study 
is given in Cao et.al (2012). The dataset contains 7 
controls of human ductal cells is transfected with 
AdGFP control vector biological samples and 7 
treated human duct cell after transduction with Ad- 
NRP2-GFP vectors in 3, 14 and 20 days time series. 
The samples is labeled with Cy3 (green dye) of 
control and Cy5 (red dye) of treated samples is 
hybridized with Affymetrix HG133A samples. 
However, the preprocessing and normalization 
procedures is implemented using Bioconductor 
packages [12] to remove systematic variance and 
prepare datasets for further analysis. 

The preprocessing of raw data using bioconductor 
tools to make over statistical metrics to predict quality 
of outlier data. The GCRMA (Gene-Chip Robust 
Multiarray Average) [13] algorithm include noise and 
non-specific binding (NSB) data calculation to 
optimize background intensities that adjust probe 
intensities to expression measurements and the 
normalized data is summarized using Robust 
Microarray Analysis (RMA) algorithm [14], The raw 
signal intensities of randomly retained datasets are in 
position of the PM and MM for every probe pairs, the 
MM probes introduces more background noise on a 
raw intensity scale to correct for NSB. The log2 
transformation of two-color arrays convert data to a 
linear scale value of each background corrected PM 
probe is obtained and these values are normalized 
using Q function. The RMA and GCRMA packages 
helps for only PM signals of background corrections. 
Thus the normalization techniques used only for 
conjunction with (pmonly) PM correction method. 
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We used Li-Wong procedure to normalize arrays 
using invariant set of genes and then fit probe set data 
to parametric model [15]. 


transcripts of differentially regulated NRP2 are 
unpaired with P<0.05 of up and down regulations is 
calculated using R and Bio Conductor. 


Differential gene expression of NRP2 in Breast data 
After normalization and preprocessing the resultant 
data is used for statistical analysis to predict the 
differential gene expression. Clustering algorithms 
such as hierarchical [16] and K-means clustering [17] 
or self- organizing maps (SOM)[18] used to generate 
partial solutions of single factors. Hiearchical 
clustering specifically describes the differential gene 
expression based on distance matrix are connected by 
a series of branches (clustering tree or dendrogram). 
This method helps to classify gene expression 
analysis of complete linkage outforms. K-means 
clustering algorithm predicts the best fitting between 
clusters and their representation using a predefined 
number of clusters. The prototype of randomly 
selected datasets of lowest dissimilarity and represents 
smallest cluster variance. 

2.3 Evaluation of functional Annotation 

The clustering data of functional data is used for GO 
annotation of the genes extracted from Affymetrix 
HG133A annotation file. The Database for 
Annotation, Visualization and Integrated Discovery 
(DAVID) Gene Functional Classification Tool (http:// 
david.abcc. ncifcrf.gov) [19] used to identify list of 
genes associated with biological terms into organized 
classes. The organizes of significant genes shared 
specific terms by <5% of the genes paired with 
co mm on GO terms that functionally related. Genome 
enrichment analysis is predicted using Gorilla 
(http://cbl-gorilla.cs. technion.ac.il/) [20] to get gene 
rank list. 

3. Results and Discussion 

3.1. Experimental analysis of NRP2 over 
expression in Breast cell lines 

How NRP2 is expressed differentially in both 
exocrine, endocrine and ductal cells. We performed 
an expression analysis of 14 datasets of NRP2 
mediated reprogrammed datasets is annotated with 
Affymetrix Human Genome U133A of adult Breast 
duct cells of expression datasets. The isolates of 
NRP2 deficient Breast duct cells that are abundant 
expression in islet endocrine and neural tissues. There 
are 14 experimental datasets of control vectors of 
mRNA transcriptional gene factors data of 3, 14 and 
20 days is compared with NRP2 expression of NRP2 
cell cultures of datasets from GEO database. The 


The differentially expressed dataset contains 22285 
genes, the preprocessing and normalization is carried 
out using simple affy statistical software package to 
calculate thresholds of raw data. The normalization of 
expression values is calculated using RMA is close 
with GCRMA of standard processing of expression 
values, the differential gene expression of dataset is 
treated with NRP2 expression array of treated datasets 
is presented in principal component analysis (PCA) of 
3, 14 and 20 days datasets from both early and late 
stage of gene expression (Figure: 1). The incremental 
analysis of repeated genes across independent datasets 
is biologically active that filtering out differentially. 


We determine the differential expressions of Breast 
cancer genes were independently classified. Using 
fold change (FC) cutoff of 14 independent datasets 
and a FDR-corrected P-values (P<0.05) of 15410 
genes were filtered from background corrections. 
Using functional enrichment and gene ontology to 
classified 11721 genes were differentially expressed 
in overall dataset. The Pearson's Correlation 
coefficient of hierarchical clustering shows the 
differentially expressed genes is aligned based on 
Euclidean distance. The highly significant Euclidean 
distance with poor correlations did not show 
statistically significance with correlations of series of 
outcomes (fig 2). A set of 400 differentially expressed 
genes of significant (P <0.05) thresholds of well 
defined clusters is used for linkage analysis. 
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Figurel. Principle component analysis of 14 datasets 
including NRP2 data analysis 
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3.2 Identification of up regulated genes 

We used genet Classifier to classify significant genes 
of differential expression at <0.95 threshold shows 
182 up regulated, 229 down regulated. The initial 
screening is carried out with all pairwise z-vlues 
associated with significant correlation of up regulated 
and down regulated datasets. Using false positive 
prediction of 46 gene ranks were up regulated at two 
folds, 65 genes are also significantly expressed in 
precisely explained Breast cancer (fig: 3a, 3b). 
Supplementary Table 1 provides the up regulated and 
down regulated genes have well established and 
significantly associated with lack of neuronal cell 
development in Breast cell development. Some well 
kn own genes include RAB40A,FGFR1,TPM1,NRP2, 
and CLMN genes is down regulated in Breast duct, 
endocrine and exocrince cells and is less effective to 
synthesize neuronal cell development and 
differentiation into insulin production. Hence, these 
genes is potential targets to Breast cancer. 
RAB40A,FGFR1,TPM1,NRP2, and CLMN genes is 
significantly down regulated genes in Breast cancer 
on neuronal cell development, but these genes directly 
associated with other types of cancer such as 
pancreatic cancer, ovarian cancer and other cancer 
types. 
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Figure2. Hierarchical clustering of 400 differentially 
expressed significant genes predicted with p<0.05 
threshold. 
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Figure3a. Differential expression of case and control 
dataset genes were predicted with significance of 
<0.05 threshold. 
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Figure3b. Differential expression of both commonly 
associated biomarker genes is predicted with 
significance of <0.05 threshold. 


3.3 Functional analysis of NRP2 associated 
significant genes 

We identified the functions of 46 significant genes of 
both up regulated and down regulated genes were 
analyzed using GOrilla online database. The twelve 
up regulated genes are functionally involved in cell- 
differentiation, cell development and disease 
progression with in Breast cells. The dysregulation of 
RAB40A, FGFR1, TPM1, NRP2, CLMN of p-value 
2.46E-5 is significantly involved in different 
pathways with in cells of breast. 
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4. Conclusion 

We have predicted the gene expression of NRP2 gene 
regulation in Breast neuronal cell development that 
leads to cancer. The ductal, exocrine and endocrine 
cells of neurogenesis of expressed datasets predicts 46 
genes of up regulated and down regulated that 
potentially signified in Breast cancer and also 
associated with other significant cancer types. Based 
on this analysis the genes RAB40A, FGFR1, TPM1, 
NRP2, and CLMN are down regulated in Breast cells 
and also with different cancer types is potentially 
predicted best biomarkers to understand Breast cancer 
and associated cell types. Our results are helps to 
predict the gene regulation networks of other 
significant gene regulatory networks is helps to 
predict best drug targets and is used for drug 
discovery. 
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